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Abstract 

Computing  discrete  two-dimensional  convolutions  is  an  important  problem  in  image  pro¬ 
cessing.  In  mathematical  morphology,  an  important  variant  is  that  of  computing  binary 
convolutions,  where  the  kernel  of  the  convolution  is  a  0-1  valued  function.  This  operation 
can  be  quite  costly,  especially  when  large  kernels  are  involved.  In  this  paper,  we  present 
an  algorithm  for  computing  convolutions  of  this  form,  where  the  kernel  of  the  binary 
convolution  is  derived  from  a  convex  polygon.  Because  the  kernel  is  a  geometric  object, 
we  allow  the  algorithm  some  flexibility  in  how  it  elects  to  digitize  the  convex  kernel  at 
each  placement,  as  long  as  the  digitization  satisfies  certain  reasonable  requirements.  We 
say  that  such  a  convolution  is  valid.  Given  this  flexibility  we  show  that  it  is  possible  to 
compute  binary  convolutions  more  efficiently  than  would  normally  be  possible  for  large 
kernels.  Our  main  result  is  an  algorithm  which,  given  an  m  X  n  image  and  a  fc-sided 
convex  polygonal  kernel,  computes  a  valid  convolution  in  0{kmn)  time.  Unlike  standard 
algorithms  for  computing  correlations  and  convolutions,  the  running  time  is  independent 
of  the  area  or  perimeter  of  A,  and  our  techniques  do  not  rely  on  computing  fast  Fourier 
transforms.  Our  algorithm  is  based  on  a  novel  use  of  Bresenham’s  line-drawing  algo¬ 
rithm  and  prefix-sums  to  update  the  convolution  efficiently  as  the  kernel  is  moved  from 
one  position  to  another  across  the  image. 
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Abstract 

Computing  discrete  two-dimensional  convolutions  is  an  important  problem  in  image  pro¬ 
cessing.  In  mathematical  morphology,  an  important  variant  is  that  of  computing  binary 
convolutions,  where  the  kernel  of  the  convolution  is  a  0-1  valued  function.  This  operation 
can  be  quite  costly,  especially  when  large  kernels  are  involved.  In  this  paper,  we  present 
an  algorithm  for  computing  convolutions  of  this  form,  where  the  kernel  of  the  binary 
convolution  is  derived  from  a  convex  polygon.  Because  the  kernel  is  a  geometric  object, 
we  allow  the  algorithm  some  flexibility  in  how  it  elects  to  digitize  the  convex  kernel  at 
each  placement,  as  long  as  the  digitization  satishes  certain  reasonable  requirements.  We 
say  that  such  a  convolution  is  valid.  Given  this  flexibility  we  show  that  it  is  possible  to 
compute  binary  convolutions  more  efficiently  than  would  normally  be  possible  for  large 
kernels.  Our  main  result  is  an  algorithm  which,  given  an  m  X  n  image  and  a  fc-sided 
convex  polygonal  kernel,  computes  a  valid  convolution  in  0(kmn)  time.  Unlike  standard 
algorithms  for  computing  correlations  and  convolutions,  the  running  time  is  independent 
of  the  area  or  perimeter  of  A,  and  our  techniques  do  not  rely  on  computing  fast  Fourier 
transforms.  Our  algorithm  is  based  on  a  novel  use  of  Bresenham’s  line-drawing  algo¬ 
rithm  and  prehx-sums  to  update  the  convolution  efficiently  as  the  kernel  is  moved  from 
one  position  to  another  across  the  image. 
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the  National  Aeronautics  and  Space  Administration  under  Contract  NAS  5555-37. 


1  Introduction 


A  fundamental  problem  in  image  processing  is  that  of  computing  discrete  convolutions 
[7,  9,  4],  Consider  an  image  which  is  given  as  a  2-dimensional  m  X  n  array  I  of  real 
numeric  values.  We  will  think  of  the  image  as  dehning  a  function  /  :  >  R,  where 


if  1  ^  ^  and  1  <  <  nr, 

otherwise. 


(The  transposition  of  and  gy  reflects  the  convention  that  arrays  are  indexed  hrst  by 
row,  then  by  column.)  The  kernel  of  the  convolution  is  a  o'  x  r  image  array  K.  The 
discrete  convolution  [7]  of  I  with  if,  denoted  by  /  *  if,  is  dehned  to  be 

{I  *  K)[x,y]  =  '^Yl  -b]. 

a  h 


It  is  common  to  embed  I  and  if  within  larger  images  to  avoid  wraparound  effects,  but 
this  will  not  be  of  concern  to  us  here. 

A  binary  convolution  is  a  special  case  of  a  discrete  convolution  where  if  is  a  0-t  valued 
function.  Binary  convolutions  are  of  particular  interest  in  computational  morphology  and 
digital  geometry  [15,  16].  For  example,  the  dilation  of  a  digital  shape,  described  by  a 
0-1  image  /,  by  a  digital  kernel  if  described  by  another  such  image  can  be  expressed  by 
computing  the  convolution  I *(— if),  and  then  thresholding  this  image  so  that  all  positive 
values  are  mapped  to  1.  Binary  convolutions  are  also  useful  in  template  matching  [7]  in 
binary  images,  through  the  use  of  the  related  correlation  operation.  Our  results  apply  to 
computing  binary  correlations  as  well.  Binary  convolutions  have  the  following  geometrical 
interpretation:  Let  —if  denote  the  set  of  points  {—p  |  p  G  if }.  We  can  interpret  I  *  K{p) 
as  placing  a  copy  of  —if  at  location  p  of  the  image,  and  then  computing  the  number  of 
pixels  that  are  overlapped  by  —if. 

One  problem  with  computing  discrete  convolutions  is  that  the  operation  can  be  quite 
expensive  when  the  kernel  of  the  convolution  is  large.  A  naive  algorithm  for  computing 
the  convolution  considers  each  placement  of  the  q  X  r  kernel,  and  computes  the  weighted 
sum  in  0{qr)  time.  Since  there  are  mn  possible  placements,  this  results  in  an  algorithm 
whose  running  time  is  0{mnqr).  Here  we  assume  that  q  <  m  and  r  <  n,  but  these 
quantities  may  still  be  large.  The  question  is  whether  we  can  improve  on  the  qr  factor, 
especially  when  qr  is  large. 

One  approach  is  based  on  the  idea  of  decomposing  a  convolution  involving  a  large 
kernel  as  a  sequence  of  convolutions  involving  small  kernels  [It,  14].  In  the  case  of  binary 
morphology,  search  algorithms  have  been  proposed  to  decompose  the  kernels  as  dilations 
of  two-point  structuring  elements  [18].  The  basic  two-point  search  algorithm  was  also 
extended  to  grayscale  operations  [3].  However,  many  shapes  such  as  triangles  are  not  two- 
point  decomposable  and  so  the  two-point  decomposition  algorithms  cannot  be  applied 
to  them.  When  the  kernel  shape  is  restricted  to  shapes  formed  by  intersections  of  half¬ 
planes  at  multiples  of  45  degrees,  it  has  been  shown  that  the  kernel  can  be  decomposed 
as  dilations  of  kernels  from  a  basis  set  [10].  This  decomposition  method  has  not  been 
generalized  to  discrete  polygons  with  sides  at  arbitrary  angles. 
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The  concepts  of  Singular  Value  Decomposition  (SVD)  and  Small  Generating  Kernel 
(SGK)  have  been  used  to  speed  up  the  grayscale  convolution  processing  time  by  decom¬ 
posing  the  kernel  into  separable  biters  and  then  decomposing  each  separable  biter  as  a 
sequence  of  SGK  biters  [12],  The  speedup  is  obtained  by  using  the  kernels  corresponding 
to  the  larger  eigenvalues.  The  SVD/SGK  methods,  however,  are  useful  only  in  the  case 
of  grayscale  convolutions. 

Other  methods  involve  using  table  lookup  to  avoid  the  cost  of  multiplication  [5,  17]. 
Burt  proposed  a  technique  based  on  the  use  of  quadtrees  [2].  However,  these  methods 
improve  running  times  only  by  a  constant  factor.  A  common  approach  to  computing 
convolutions  for  large  kernels  is  brst  to  compute  the  Fourier  transforms  of  the  image 
and  kernel,  denoted  by  I'  and  K' .  Then  the  convolution  I  *  K  can  be  approximated 
in  0{mn)  time  by  computing  the  elementwise  product  I'  ■  K' ,  and  then  inverting  the 
transform  [7].  This  approach  requires  only  O(mnlog(mn))  time,  which  is  a  signibcant 
saving.  However,  for  morphological  and  other  discrete  applications,  it  has  the  inelegant 
property  of  converting  an  exact  discrete  problem  into  a  continuous  problem. 

In  this  paper  we  consider  a  signibcantly  different  approach.  We  consider  the  problem 
of  computing  binary  convolutions  where  the  kernel  of  the  convolution  is  derived  from  a 
convex  polygon.  We  introduce  the  notion  of  a  valid  digitization  of  a  geometric  shape. 
We  present  formal  debnitions  later,  but  intuitively,  a  digitization  is  valid  if  pixels  lying 
entirely  inside  the  shape  are  in  the  digitization  and  pixels  lying  entirely  outside  the  shape 
are  not  in  the  digitization.  We  then  debne  the  notion  of  a  valid  convolution,  which  is 
based  on  using  valid  digitizations  of  the  kernel  to  perform  the  convolution.  Different 
placements  of  the  kernel  are  allowed  to  use  different  digitizations.  We  show  that  with 
this  added  bexibility  it  is  possible  to  compute  digitizations  for  convex  polygonal  kernels 
in  time  that  is  independent  of  the  area  or  perimeter  of  the  kernel.  In  particular,  we  show 
that  a  valid  convolution  of  an  m  X  n  image  with  a  fc-sided  convex  polygonal  kernel  can  be 
computed  in  0(kmn)  time  and  0{mn)  space.  This  type  of  convolution  is  of  interest  in 
morphology  applications,  where  the  operator  is  a  convex  polygon  or  can  be  approximated 
by  one.  If  k  is  small,  this  can  be  signibcantly  faster  than  existing  approaches  for  large 
kernels.  The  most  closely  related  work  to  ours  is  that  of  box-bltering  [f3].  However, 
box-bltering  is  limited  to  rectangular  shapes.  Our  approach  is  a  generalization  of  the 
box-bltering  method  to  non- rectangular  convex  polygons. 

2  Definitions  and  Notation 

We  begin  with  a  number  of  debnitions.  Let  7?  denote  the  set  of  ordered  pairs  of  integers, 
called  grid  points.  Given  g  ^  debne  TT{g)  to  be  a  half-open  unit  square  centered  at  g, 
called  g^s  pixel.  The  set  of  TT{g)  for  all  g  ^  Z^  subdivides  the  real  plane  into  a  collection 
of  unit  squares  with  pairwise  disjoint  interiors. 

Let  denote  the  set  of  ordered  pairs  of  reals.  Given  a  set  P  C  R^  and  t  G  R^,  let 
t  P  denote  the  translate  of  P  by  t,  that  is, 

t  +  P  =  {t+p\pe  P}. 

We  will  call  this  the  placement  of  P  at  t.  Let  —P  =  {—p  \  p  G  P},  and  debne  t  —  P  to 
be  f  +  (— P).  Given  a  set  P  C  R^,  we  are  interested  in  ways  of  mapping  P  into  a  set 
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of  grid  points,  called  a  digitization.  We  say  that  a  digitization  D{P)  C  7?  is  valid  if  for 
every  pixel  that  lies  entirely  within  P  the  corresponding  grid  point  is  in  the  digitization, 
and  for  every  pixel  that  is  entirely  disjoint  from  P  the  corresponding  grid  point  is  not  in 
the  digitization.  More  formally. 


7r(5)cp  ^  geD{P) 

(7r(5)np  =  0)  ^  g^DiP). 

Pixels  that  partially  overlap  P  may  or  may  not  be  in  a  valid  digitation.  An  example  of 
a  valid  digitization  is  the  midpoint  digitization,  denoted  by  D'^{P),  which  consists  of  all 
grid  points  that  he  within  P.  Fig.  t  shows  valid  digitizations  of  three  different  placements 
of  the  same  polygon  P.  The  one  in  the  center  is  the  midpoint  digitization. 


Fignre  t:  Valid  digitizations. 

Given  an  image  I  and  any  set  of  grid  points  G  C  Z^,  dehne  the  weight  of  G  relative 
to  I  to  be  the  snm  of  the  image  valnes  of  G: 

^(G)  =  %)• 

geG 

Let  iL  be  a  convex  polygon  in  the  plane,  and  let  I  be  an  image.  We  dehne  the 
midpoint  convolution  of  /  by  iL  to  be  an  m  X  n  image  G,  where  C{t)  is  dehned  to  be  the 
weight  of  the  midpoint  digitization  of  —K  placed  at  t,  that  is, 

C(t)  =  Y.  Ha)- 

K  is  called  the  kernel  of  the  convolntion.  A  valid  convolution  of  /  by  iL  is  dehned 
in  the  same  way,  bnt  where  D'^[t  —  K)  is  replaced  by  any  valid  digitization  of  t  — 
K .  Note  that  becanse  valid  digitizations  are  not  nniqne,  different  placements  may  be 
digitized  differently.  The  midpoint  and  valid  convolntion  are  generalizations  of  the  binary 
convolntion. 

Onr  main  resnlt  is  the  following. 

Theorem  1  Given  an  mxn  image  I  and  a  k-sided  convex  polygon  K ,  a  valid  convolution 
of  I  by  K  can  he  computed  in  0(kmn)  time  and  0{mn)  space. 
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Henceforth,  to  avoid  the  continual  need  for  negations,  we  will  assume  that  the  kernel 
for  the  convolution  is  —K,  and  so  each  element  of  the  convolution  is  the  weight  of  some 
translate  of  K .  We  assume  that  the  image  is  a  two-dimensional  rectangular  array  with  m 
rows  and  n  columns.  The  pixels  of  the  image  form  a  subdivision  of  the  image  rectangle 
into  squares: 

/?  =  {(x,  y)  I  0.5  <  X  <  n  +  0.5,  0.5  <  y  <  m  +  0.5}. 

All  grid  points  outside  this  rectangle  are  assumed  to  have  value  0.  We  assume  that  the 
kernel  K  is  dehned  by  a  counterclockwise  cyclic  list  of  its  vertices. 

3  The  Algorithm 

We  hrst  give  an  overview  of  the  algorithm  for  computing  a  valid  convolution.  Let  k 
denote  the  number  of  sides  of  the  kernel  A,  and  recall  that  m  and  n  denote  the  height 
and  width,  respectively,  of  the  image.  The  algorithm  consists  of  a  number  of  stages. 
The  hrst  involves  preprocessing  the  kernel  A,  and  results  in  a  representation  of  A  as  a 
weighted  sum  of  a  set  of  0{k)  primitive  shapes.  Each  primitive  shape  is  an  axis-aligned 
rectangle  or  right  triangle.  The  non-axis-aligned  side  of  a  right  triangle  is  called  its 
slanted  side. 

The  second  stage  involves  preprocessing  the  image.  We  create  at  most  0{k)  sequences 
of  equally  spaced  parallel  lines,  where  each  sequence  is  either  horizontal,  vertical,  or 
parallel  to  a  side  of  K .  These  are  called  canonical  lines.  Each  sequence  of  canonical  lines 
partitions  the  image  rectangle  into  a  collection  of  thin  regions,  called  canonical  strips. 
We  digitize  each  strip  using  midpoint  digitization  and  preprocess  it  by  a  method  to  be 
described  later.  The  resulting  structure  has  the  property  that,  in  constant  time,  it  is 
possible  to  compute  the  total  weight  of  a  trapezoid  bounded  by  a  strip  and  two  lines  that 
are  either  horizontal  or  vertical.  We  will  show  that  this  structure  can  be  built  in  time 
and  space  0(mn)  for  each  side  of  K . 

After  these  preprocessing  phases,  the  third  stage  of  the  algorithm  computes  the  actual 
convolution.  It  is  based  on  computing  valid  convolutions  for  each  of  the  primitive  shapes 
and  then  summing  the  results  over  all  0{k)  shapes  to  get  the  hnal  convolution.  Eor 
any  placement  of  a  primitive  shape,  we  will  dehne  a  special  valid  digitization  called  the 
canonical  digitization.  These  digitizations  will  be  dehned  in  such  a  way  that  they  are 
pairwise  disjoint,  and  their  union  is  a  valid  digitization  of  K.  We  will  show  that  for 
each  primitive  shape,  the  weight  of  a  single  placement  of  the  shape  can  be  computed 
in  0(mn)  time.  Then  we  will  show  that  once  the  weight  of  one  placement  is  known,  it 
is  possible  to  update  the  weight  when  the  placement  is  translated  by  a  unit  distance, 
either  horizontally  or  vertically.  This  is  done  in  constant  time,  by  applying  the  structure 
described  in  the  previous  paragraph.  Thus  the  entire  convolution  for  one  shape  can  be 
computed  in  0(mn)  time  by  computing  an  initial  weight  and  then  shifting  the  shape, 
pixel  by  pixel,  over  the  entire  image.  By  applying  this  to  each  primitive  shape  and 
then  computing  the  weighted  sum  of  these  convolutions,  the  convolution  by  K  can  be 
computed  in  0{kmn)  total  time.  The  various  elements  of  the  algorithm  are  explained  in 
detail  in  the  following  subsections. 
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3.1  Decomposition  into  Primitive  Shapes 

As  mentioned  above,  the  first  stage  of  the  afgorithm  invofves  decomposing  K  into  0{k) 
primitive  shapes.  The  decomposition  is  constructed  by  hrst  encfosing  K  in  an  axis-afigned 
bounding  box  and  then  decomposing  the  difference  B\K  into  a  coUection  of  rectangfes 
and  right  triangfes.  For  each  vertex  of  K  imagine  shooting  two  buffets  horizontaffy  and 
verticaffy  away  from  the  interior  of  if,  untif  hitting  either  the  bounding  box  B  or  the 
previous  buffet’s  path.  (See  Fig  2.)  It  is  easy  to  see  that  these  buffet  paths  subdivide 
B\K  into  a  set  of  rectangles  and  right  triangles  with  pairwise  disjoint  interiors.  Together 
with  B,  these  form  the  set  of  primitive  shapes. 


Figure  2:  Decomposing  the  kernel  into  primitive  shapes. 


Let  k  denote  the  number  of  sides  of  K  and  let  r  denote  the  number  of  such  shapes. 
Observe  that  each  time  a  buffet  is  shot,  it  splits  some  region  into  two  subregions.  Since 
at  most  2k  buffets  are  shot  (at  most  two  per  vertex),  and  we  started  with  the  bounding 
box  5,  it  follows  that  r  <  2k  -\-  \  =  0(k).  Let  ifi,  1^2,  ■  ■  ■ ,  Kr  denote  these  shapes.  Let 
bnd(ifi)  denote  the  boundary  of  Ki. 

First  we  observe  that  K  can  be  expressed  as  a  weighted  sum  of  these  shapes.  The 
following  lemma  establishes  this  fact  for  all  nonboundary  points.  The  bounding  box  B 
is  assigned  a  weight  of  +f,  and  all  the  other  primitive  shapes  are  assigned  a  weight  of 
—  f.  Let  Wi  denote  the  weight  of  Ki.  Let  Ki{p)  be  f  if  p  G  and  0  otherwise.  For  all 
P  G  dehne  the  weight  of  p  to  be  LF (p)  =  ^[=1  Wi  ■  Ki{p). 


Lemma  1  For  all  p  G  p  ^  U''^;^bnd(ifi), 


if  p  G  if , 
otherwise. 


(1) 


Proof:  Points  outside  B  are  clearly  not  in  if  and  have  weight  0.  Points  within  the 
interior  of  if  have  weight  f  since  they  he  inside  B  but  outside  all  the  other  primitive 
shapes.  Because  the  interiors  of  the  primitive  shapes  other  than  B  are  pairwise  disjoint 
and  cover  B\K .,  a  point  of  B\K  that  does  not  fie  on  the  boundary  of  any  primitive  shape 
has  weight  f  —  f  =  0.  □ 

Points  that  he  on  the  boundaries  of  the  primitive  shapes  are  messier  to  deaf  with, 
since  the  result  holds  only  if  each  is  assigned  to  a  unique  primitive  shape.  We  will  deaf 
with  this  by  a  tie-breaking  rule,  which  we  term  the  nonintegrality  assumption.  We  assume 
that  for  each  t  ,  the  coordinates  of  the  vertices  of  f  +  if  are  not  integers.  With  this 
assumption,  no  grid  points  he  on  the  horizontal  or  vertical  boundaries  of  the  primitive 


5 


shapes.  This  can  be  handled  by  assnming  that  K  is  given  with  arbitrary  (possibly 
integer)  coordinates,  and  that  each  integer  coordinate  has  been  pertnrbed  by  adding 
an  inhnitesimal  valne  e  >  0.  This  can  be  implemented  symbolically  withont  actnally 
modifying  the  coordinates  of  K .  When  making  a  comparison  between  a  coordinate  and 
an  integer  qnantity,  if  the  valnes  are  not  eqnal,  the  resnlt  of  the  comparison  is  nnchanged. 
If  the  valnes  are  eqnal,  we  break  the  tie  as  if  the  coordinate  valne  has  been  increased 
inhnitesimally.  The  slanted  sides  bonnding  primitive  shapes  may  also  pass  throngh  grid 
points.  These  will  also  be  handled  by  symbolic  pertnrbation.  Bnt  this  is  not  necessary, 
becanse  onr  dehnition  of  a  valid  digitization  provides  the  freedom  to  either  select  snch  a 
point  or  not. 

3.2  Canonical  Lines  and  Canonical  Digitizations 

For  each  f  G  Z^,  and  each  primitive  shape  of  Ki,  we  dehne  a  special  digitization  of  the 
placement  i  -\-  Ki^  called  the  canonical  digitization  and  denoted  by  D^^it  -\-  Ki).  This 
will  be  done  in  snch  a  way  that  the  weighted  snm  of  these  digitizations  dehnes  a  valid 
digitization  of  the  placement  t  -\-  K . 

Consider  any  primitive  shape  Ki.  If  Ki  is  a  rectangle  then  dehne  D’^it  +  Ki)  to  be 
the  set  of  grid  points  lying  within  the  intersection  oi  t  Ki  and  the  image  rectangle  R. 
(By  onr  nonintegrality  assnmption,  no  grid  points  lie  on  the  bonndary  of  this  set.)  If 
Ki  is  a  right  triangle,  then  main  issne  is  how  to  digitize  its  slanted  side.  To  do  this  we 
introdnce  the  notion  of  a  canonical  line.  We  consider  two  cases  depending  on  the  slope  of 
KiS  slanted  side.  If  the  absolnte  valne  of  the  slope  is  less  than  1,  we  call  Ki  a  low-slope 
triangle]  otherwise,  we  call  it  a  high-slope  triangle.  Below  we  consider  the  high-slope  case. 
The  low-slope  case  is  handled  in  a  symmetrical  manner,  by  swapping  the  roles  of  the  x- 
and  y-axes. 

Let  s  denote  the  slope  of  the  slanted  side  of  Ki.  Consider  the  sorted  seqnence  of 
lines  that  intersect  the  image  rectangle  if,  have  slope  5,  and  have  x-intercept  an  integer 
mnltiple  of  0.5.  Snch  a  seqnence  is  illnstrated  in  Fig.  3  below.  Observe  that  the  hor¬ 
izontal  distance  between  two  consecntive  canonical  lines  is  0.5.  (In  the  low-slope  case, 
y-intercepts  are  nsed  instead,  and  the  vertical  spacing  is  0.5.)  These  lines  snbdivide  R 
into  a  collection  of  thin  regions,  called  canonical  strips.  (The  reader  may  wonder  abont 
the  reason  for  the  choice  of  0.5  as  the  separation  distance.  This  will  be  discnssed  later.) 
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Lemma  2  For  any  slope  s,  the  number  of  canonical  lines  and  number  of  canonical  strips 
is  0{m  +  n). 


Proof:  Assume  for  concreteness  that  s  is  positive  and  s  >  t.  The  proofs  for  the  other 
cases  follow  from  simple  symmetry.  Recall  the  dehnition  of  the  image  rectangle  R  from 
Section  2.  By  easy  calculations,  it  can  be  seen  that  any  line  of  slope  5  intersects  R  if  and 
only  if  its  x-intercept  lies  within  the  interval 


I  = 


0.5  - 


m  +  0.5 


,  n  +  0.5  ^1  - 


Figure  4:  The  range  of  canonical  lines. 

(See  Fig.  4.)  It  follows  that  the  set  of  x-intercepts  of  the  canonical  lines  for  slope  s  is 
{i/‘2  I  i  G  Z  n  2/},  where  2/  is  the  interval  whose  endpoints  have  coordinates  twice  those 
of  I.  Since  s  >  t,  the  width  of  I  satishes 

width(/)  =  n  +  0.5  ^t - ^  —  0.5  H - — — ^  <  n  +  m  +  0.5. 

An  interval  of  width  w  can  be  subdivided  into  at  most  re  +  t  strips  of  unit  length.  Since 
the  canonical  strips  have  width  0.5,  it  follows  that  the  number  of  canonical  strips  is  at 
most  2(n  +  m)  +  3.  The  number  of  canonical  lines  is  at  most  one  greater  than  this.  □ 
The  canonical  digitization  oit-\-Ki  is  dehned  as  follows.  Consider  the  line  £  supporting 
the  slanted  side  of  Ki.  If  t  does  not  intersect  the  image  rectangle  it,  then  the  intersection 
oi  t  Ki  is  either  empty  or  is  a  rectangle.  In  the  latter  case,  the  canonical  digitization 
is  dehned  to  be  the  set  of  grid  points  lying  within  this  rectangle.  Otherwise,  assume 
for  concreteness  that  the  triangle  lies  to  the  right  of  the  slanted  line.  Select  the  nearest 
canonical  line  £*  that  lies  on  or  to  the  right  of  £  (see  Fig.  5(a)).  This  can  be  accomplished 
by  computing  the  x-intercept  of  t,  and  then  rounding  to  the  next  larger  integer  multiple 
of  0.5.  In  general,  £  is  rounded  towards  the  interior  of  the  triangle  it  supports. 

The  canonical  digitization  is  dehned  to  be  the  set  of  grid  points  that  lie  within  the 
intersection  of  the  bounding  rectangle  oit-\-Ki  and  the  image  rectangle  it,  and  either  on  or 
to  the  right  of  £* .  (These  are  shown  as  black  points  in  Fig.  5(a).)  Notice  that  by  rounding 
to  the  canonical  line  lying  towards  the  interior  of  the  triangle,  the  canonical  digitization 
will  generally  consist  of  a  subset  of  the  grid  points  lying  in  t  Ki.  Thus  it  will  generally 
be  a  subset  of  the  midpoint  digitization  oi  t  Ki.  (For  example,  in  Fig.  5(a)  two  grid 
points  in  the  midpoint  digitization  have  been  excluded  from  the  canonical  digitization.) 
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Figure  5:  Canonical  digitization. 

The  canonical  digitization  for  the  entire  shape  is  dehned  by  taking  the  weighted  sum 
of  points  in  the  canonical  digitizations  of  the  primitive  shapes.  More  formally,  recalling 
the  weights  Wi  introduced  above,  we  dehne  the  canonical  digitization  oi  t  K  to  be  the 
weighted  sum  of  the  digitizations  of  f  +  ifj,  that  is, 

r 

D‘^{t  +  K)  =  J2w^-  D‘^{t  +  Ki). 

i=l 

In  other  words,  a  pixel  lies  in  the  canonical  digitization  if  the  weighted  sum  of  sets 
containing  this  pixel  is  1,  and  does  not  lie  in  it  if  this  weighted  sum  is  0.  (An  example  is 
shown  in  Fig.  5(b).  The  grid  points  belonging  to  the  hnal  digitization  are  shown  as  black 
points  in  this  hgure.)  Since  the  primitive  shapes  lie  outside  if,  observe  that  the  sides  are 
rounded  away  from  the  interior  of  if,  and  hence  the  points  of  the  canonical  digitization 
of  if  will  generally  be  a  superset  of  the  points  in  the  midpoint  digitization  of  if.  Next 
we  show  that  the  result  is  a  valid  digitization  of  if. 

Lemma  3  For  any  convex  polygon  if,  and  any  vector  t  7? ,  the  canonical  digitization 
D‘^(t  +  if)  is  a  valid  digitization  of  t  K . 

Proof:  By  the  dehnition  of  a  valid  digitization,  it  suffices  to  show  the  following  for  each 
grid  point  g  G  Z^.  Recall  that  '^[g)  is  the  pixel  (open  unit  square)  centered  at  g. 

(1)  If  T^{g)  lies  entirely  within  [t  +  if)  fl  R  then  g  is  assigned  a  weight  of  f . 

(2)  If  T^{g)  is  entirely  outside  of  (t  +  if)  fl  R  then  g  is  assigned  a  weight  of  0. 

(3)  If  T^{g)  intersects  the  boundary  of  (t  +  if)  fl  R  then  g  is  assigned  a  weight  of  either 
0  or  f. 

Recall  that  by  our  nonintegrality  assumption,  no  grid  point  lies  on  the  horizontal  or 
vertical  boundary  of  any  primitive  shape.  Also  recall  that  a  grid  point  g  is  in  the  midpoint 
digitization  of  a  shape  if  and  only  if  g  lies  in  that  shape.  To  establish  (1),  observe  that 
every  grid  point  in  (f  +  if)  fl  if  is  given  an  initial  weight  of  f  because  it  lies  within  the 
bounding  rectangle  t  -\-  B-  Furthermore,  because  each  of  the  canonical  digitizations  is 


I  I* 


(a) 
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a  subset  of  the  midpoint  digitization  for  that  shape,  no  canonical  digitization  for  any 
primitive  shape  can  contain  this  point.  Thus,  its  total  weight  is  1. 

To  establish  (2),  consider  a  grid  point  g  that  is  not  in  (t  K)  f]  R.  li  g  lies  entirely 
outside  the  bounding  box  f  +  5,  or  outside  the  image  bounding  box  if,  it  is  not  allocated 
to  any  canonical  digitization,  and  so  it  is  given  a  weight  of  0.  Otherwise,  g  lies  within 
this  intersection,  but  lies  within  some  primitive  shape  i  Ki.  We  consider  two  cases. 

First,  if  g  lies  within  the  canonical  digitization  of  this  shape,  we  assert  that  g  will 
not  be  in  the  canonical  digitization  of  any  other  primitive  shape  (other  than  B)  and 
hence  it  will  be  assigned  a  weight  of  1  —  1  =  0.  To  see  this,  hrst  recall  that  (ignoring 
the  bounding  box  B)  the  primitive  shapes  have  disjoint  interiors  and  have  no  grid  points 
on  their  boundaries.  Thus  the  midpoint  digitizations  of  the  shapes  are  pairwise  disjoint. 
Recall  that  each  canonical  digitization  is  a  subset  of  a  midpoint  digitization,  and  hence 
they  too  are  pairwise  disjoint,  implying  that  g  is  in  no  other  canonical  digitization. 

The  second  case  is  when  the  point  g  is  not  in  the  canonical  digitization  oi  t  Ki.  We 
claim  that  this  cannot  happen  under  the  hypothesis  of  (2).  Since  g  lies  within  (f +  ifj)nif, 
and  since  there  are  no  grid  points  on  the  horizontal  and  vertical  sides  of  primitive  shapes, 
g  fails  to  lie  in  the  canonical  digitization  because  it  lies  outside  the  associated  canonical 
line.  If  this  is  a  high-slope  line,  the  canonical  line  lies  within  a  horizontal  distance  of 
0.5  of  the  supporting  line  for  the  slanted  side.  Since  the  point  lies  outside  the  canonical 
line,  but  inside  the  supporting  line  (since  it  is  inside  the  primitive  shape),  it  follows  that 
the  horizontal  distance  from  the  point  to  the  slanted  side’s  supporting  line  is  at  most 
0.5.  However,  this  together  with  the  fact  that  g  lies  in  t  Ki  implies  that  the  associated 
side  of  K  intersects  g'’s  pixel.  This  implies  that  g'’s  pixel  does  not  lie  entirely  outside  K 
as  hypothesized.  (In  the  case  of  a  low  slope  the  argument  is  the  same,  but  the  vertical 
distance  is  at  most  0.5.)  This  establishes  (2). 

Finally,  to  show  (3),  consider  a  pixel  TT{g)  that  intersects  the  boundary  of  {t  K)  D  R. 
If  g  lies  outside  the  bounding  box  (f  +  H)  fl  if,  it  is  assigned  a  weight  of  0.  Otherwise,  g 
will  be  assigned  an  initial  weight  of  f  because  it  lies  inside  the  bounding  box.  We  claim 
that  g  can  be  in  the  canonical  digitization  of  at  most  one  other  primitive  shape.  This  is 
because  (by  our  nonintegrality  assumption)  g  can  lie  in  at  most  one  primitive  shape,  and 
hence  it  lies  in  the  midpoint  digitization  of  at  most  one  primitive  shape.  Because  the 
canonical  digitization  of  a  shape  is  a  subset  of  the  midpoint  digitization,  g  lies  in  at  most 
one  canonical  digitization.  If  g  is  in  some  such  canonical  digitization,  its  hnal  weight  is 
0,  and  otherwise  its  weight  is  1.  This  establishes  (3).  □ 

The  reason  for  rounding  lines  toward  the  interior  of  the  primitive  shape  triangles  and 
the  choice  of  0.5  as  the  separation  distance  between  canonical  lines  is  now  clear.  The 
proof  of  (2)  relied  on  the  fact  that  the  horizontal  distance  is  half  the  width  of  a  pixel.  The 
proof  of  (3)  relied  on  the  fact  that  canonical  digitizations  are  subsets  of  the  associated 
midpoint  digitizations.  If  this  were  not  the  case,  a  pixel  interior  to  K  but  intersected  by 
two  sides  of  K  (say  one  on  its  left  and  one  on  its  right)  might  be  assigned  to  two  canonical 
digitizations.  The  resulting  weight  of  the  associated  grid  point  would  bef  —  1  —  1  =  — 1, 
and  this  does  not  correspond  to  any  valid  digitization  of  K . 
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3.3  Updating  Canonical  Digitizations 

The  main  algorithmic  tasks  needed  to  compute  the  digitization  are  (1)  how  to  compute 
the  weight  of  the  canonical  digitization  of  a  single  placement  of  a  primitive  shape,  and 
(2)  how  to  update  the  weight  of  the  canonical  digitization  when  the  shape  is  shifted  by 
one  unit  distance,  either  horizontally  or  vertically.  The  hrst  task  can  be  accomplished 
by  adapting  any  standard  algorithm  for  digitizing  convex  polygons  [6].  The  second  task 
will  be  addressed  in  the  remainder  of  this  section. 

Rectangular  Shapes.  Let  us  consider  the  case  of  a  rectangular  primitive  shape  iL*, 
since  it  is  the  simplest.  For  concreteness  we  consider  the  case  of  a  translation  to  the  right 
by  one  unit.  Let  t  and  t'  be  two  placement  vectors  such  that  t'  —  t  =  (1,0).  (The  other 
unit  shifts  are  handled  similarly.)  We  assume  that  the  weight  of  the  canonical  digitization 
oi  t  Ki  is  known,  and  we  want  to  compute  the  weight  of  the  canonical  digitization  of 
i'  Ki.  First,  observe  that  the  symmetric  difference  between  (t  +  Ki)  and  (T  +  Ki)  is  the 
union  of  two  congruent  rectangles  each  of  unit  width,  the  left  one  r  belonging  to  t  Ki 
and  the  right  one  r'  belonging  to  t'  +  Ki.  (See  Fig.  6(a).)  If  the  width  of  Ki  is  less  than 
1,  we  can  think  of  these  unit-width  rectangles  as  overlapping  each  other.  The  change  in 
the  weight  between  the  two  placements  is  easily  seen  to  be  the  difference  between  the 
weights  of  r'  and  r.  Thus  we  have 

+  K,))  -  +  Ki))  =  w{D‘^{r'))  -  w{D‘^{r)). 

The  incremental  change  in  weight  can  be  computed  in  constant  time  once  we  know 
the  weights  of  r  and  r' .  There  is  a  simple  way  to  do  this.  First  we  preprocess  the  image. 
For  each  column  and  each  grid  point  we  store  the  total  weight  of  the  image  points  in 
that  same  column  that  have  equal  or  smaller  y-values.  Following  common  usage  in  the 
held  of  parallel  algorithms,  we  call  this  a  prefix  sum.  (Figs.  6(b)  and  (c)  show  the  image 
weights  and  the  column  prehx  sums,  respectively.)  Then  to  compute  the  weight  within 
the  column  rectangle,  we  take  the  difference  between  the  prehx  sums  of  the  topmost  grid 
point  in  the  rectangle  and  the  grid  point  just  below  the  bottom  of  the  rectangle  (or  zero, 
if  the  rectangle  extends  to  the  lowest  pixel  in  the  column).  In  Fig.  6,  the  weights  of  r 
and  r'  are  4  —  f  =  3  and  6  —  2  =  4,  respectively.  Thus  the  net  change  in  weight  between 
placements  t  and  T  is  4  —  3  =  1. 

The  prehx  sums  for  each  column  can  be  computed  by  a  simple  scan  in  0(m)  time, 
implying  that  all  the  prehx  sums  can  be  computed  in  0{mn)  total  time.  The  weight 
of  the  canonical  digitization  of  any  rectangle  of  width  f  can  be  computed  in  constant 
time  by  rounding  its  x-coordinates  to  determine  the  grid  column  that  it  spans,  and  then 
rounding  its  y-coordinates  to  determine  the  elements  of  the  prehx  sum  whose  difference 
is  to  be  taken. 

A  vertical  unit-length  translation  is  handled  similarly,  but  it  results  in  two  rectangles 
of  unit  vertical  height.  The  preprocessing  for  this  case  consists  of  computing  prehx  sums 
for  each  of  the  rows. 

Triangular  Shapes:  Horizontal  Translation.  Next  we  consider  how  to  update  the 
weight  of  the  canonical  digitization  of  the  placement  of  a  right-triangle  primitive  shape 
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Figure  6:  Updating  weights  for  the  translation  of  a  rectangle. 


Ki.  We  will  assume  that  the  slanted  side  of  Ki  has  a  slope  that  is  positive  and  at  least 
1.  The  cases  for  negative  and/or  low  slopes  are  handled  similarly. 

Let  us  hrst  consider  the  case  of  a  horizontal  translation;  we  will  consider  vertical 
translations  later.  As  before,  let  t-\-Ki  denote  the  current  placement  of  whose  weight 
we  know,  and  let  t'  +  Ki  be  the  new  placement,  whose  weight  we  wish  to  compute.  Let 
t'  —  t  =  (1,  0).  (See  Fig.  7(a).  Assume  that  the  hgure  shows  the  triangles  after  the  slanted 
sides  have  been  rounded  to  the  appropriate  canonical  lines.)  ft  is  easy  to  see  that  the  net 
change  in  weight  can  be  expressed  as  the  sum  and  difference  of  the  two  lightly  shaded 
trapezoids  shown  in  the  hgure.  One  trapezoid  has  vertical  parallel  sides  aligned  with  the 
right  side  of  the  shape  and  is  added.  The  other  has  parallel  sides  aligned  with  the  slanted 
side  and  is  subtracted.  The  dark  shaded  region  shown  in  the  hgure  is  both  added  and 
subtracted,  so  its  net  weight  change  is  zero. 


The  horizontal  width  of  the  trapezoid  along  the  right  side  is  exactly  one  unit,  and 
hence  it  spans  exactly  one  vertical  column  of  grid  points.  Its  adjusted  weight  can  be 
computed  using  the  same  method  described  earlier  for  rectangles.  In  particular,  assuming 
that  the  prehx  sums  for  the  columns  have  been  computed,  it  sufhces  to  determine  the 
topmost  grid  point  of  the  column  lying  within  the  trapezoid  and  the  topmost  grid  point 
of  the  column  lying  immediately  below  the  trapezoid.  The  only  added  complication  here 
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is  that  the  top  side  of  the  trapezoid  is  not  horizontal,  as  it  was  in  the  rectangle  case. 
Nonetheless,  by  compnting  the  y-coordinate  of  the  intersection  point  of  the  slanted  top 
side  of  the  trapezoid  with  the  line  containing  the  grid  points  and  ronnding  to  the  next 
smaller  integer  valne,  we  can  compnte  this  grid  point  and  its  prehx  weight  in  constant 
time.  (For  example,  the  weight  of  the  vertical  trapezoid  shown  in  Fig.  7(c)  is  6  —  2  =  4.) 

Next  we  consider  the  canonical  digitization  of  the  slanted  trapezoid.  We  assert  that 
the  horizontal  distance  between  the  slanted  sides  in  the  canonical  digitization  of  this 
trapezoid  is  exactly  one  nnit.  This  follows  from  the  facts  that  (f)  the  original  slanted 
lines  of  the  triangle  before  and  after  translation  are  separated  by  a  distance  of  one  nnit, 

(2)  the  canonical  lines  for  high  slopes  are  separated  by  a  horizontal  distance  of  0.5,  and 

(3)  both  slanted  lines  are  ronnded  in  the  same  direction. 

Becanse  the  horizontal  width  of  the  slanted  trapezoid  is  one  nnit,  it  spans  exactly 
two  canonical  strips.  To  compnte  the  weight  of  the  trapezoid,  it  snffices  to  compnte  the 
snm  weights  of  the  pixels  lying  in  the  two  snbtrapezoids  dehned  by  the  intersections  of 
the  two  canonical  strips  and  the  trapezoid.  First,  we  preprocess  the  image  by  compnting 
the  prehx  sums  within  each  canonical  strip.  (This  is  shown  in  Fig.  7(d).)  Once  these 
prehx  snms  have  been  compnted,  we  compnte  the  difference  between  the  prehx  snm  of 
the  the  topmost  grid  point  in  each  snbtrapezoid  and  the  topmost  grid  point  in  the  strip 
lying  jnst  below  each  snbtrapezoid.  (In  the  hgnre,  the  resnlt  is  (4  —  0)  +  (3  —  1)  =  6.) 

ffere  is  a  more  detailed  explanation  of  the  process.  For  each  row  of  the  hgnre  and  for 
each  canonical  strip  the  preprocessing  phase  compntes  the  weighted  snm  of  the  grid  points 
lying  on  or  below  this  row  in  the  strip.  (For  the  low-slope  case,  the  snms  are  associated 
with  the  colnmns  of  the  image.)  Notice  that  not  every  row  has  a  grid  point  lying  in  the 
strip.  This  issne  will  be  addressed  in  the  proof  of  Lemma  4.  To  compnte  the  points  lying 
jnst  below  the  bottoms  of  the  snbtrapezoids  in  each  strip,  compnte  the  y-coordinate  of 
the  bottom  side  of  Ki  +  f,  and  then  ronnd  down  to  the  next  smaller  integer  to  get  the 
appropriate  row.  To  compnte  the  topmost  grid  point  in  a  snbtrapezoid,  hrst  ronnd  the 
right  vertical  side  of  the  trapezoid  to  the  next  smaller  integer  x-coordinate,  so  that  it 
coincides  with  a  column  of  grid  points.  Compnte  the  y-coordinate  of  the  intersection  of 
this  vertical  line  with  the  top  line  of  the  canonical  strip.  Ronnd  this  y-coordinate  down  to 
the  next  smaller  integer,  ft  is  not  hard  to  see  that  any  grid  point  lying  in  the  snbtrapezoid 
mnst  lie  on  or  below  this  row.  This  process  is  performed  for  both  snbtrapezoids,  and  the 
two  weights  are  added  together. 

Triangular  Shapes:  Vertical  Translation.  The  last  case  to  be  considered  is  the 
incremental  change  in  the  weight  of  a  right  triangle  primitive  shape,  again  with  a  high 
slope,  bnt  in  the  case  of  a  vertical  translation.  Snppose  that  the  triangle  is  translated 
vertically  npward  by  one  nnit,  from  placement  t  to  T,  where  t'  —  t  =  (0, 1).  (See  Fig.  8(a).) 
Let  s  denote  the  slope  of  the  slanted  side.  As  in  the  horizontal  case,  the  change  in  weight 
can  be  expressed  nsing  the  weights  of  two  trapezoids.  In  this  case  one  trapezoid  is  aligned 
with  the  bottom  of  Ki  and  the  other  along  the  slanted  side.  The  weight  of  the  bottom 
trapezoid  can  be  handled  in  a  symmetrical  way  to  what  was  done  in  the  horizontal  case. 

The  slanted  side  is  somewhat  different,  thongh.  A  convenient  property  of  the  hor¬ 
izontal  case  was  that  the  slanted  trapezoid  spanned  exactly  two  canonical  strips,  and 
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Figure  8:  Updating  weights  for  the  vertical  translation  of  a  triangle. 

hence  the  computation  could  be  broken  down  into  computing  the  weights  of  exactly  two 
subtrapezoids.  However,  the  horizontal  distance  between  a  high-slope  line  and  its  vertical 
translate  of  one  unit  is  f/s,  which  is  not  in  general  an  integer.  We  claim  that  this  does 
not  result  in  any  signihcant  added  complexity.  In  this  case,  since  the  horizontal  distance 
between  the  two  canonical  lines  is  f/s,  and  since  s  >  1,  it  follows  that  the  horizontal 
distance  between  the  slanted  lines  is  at  most  1.  Because  the  two  lines  are  rounded  in 
the  same  direction,  after  rounding,  the  slanted-side  trapezoid  will  generally  span  either 
0,  1,  or  2  canonical  strips.  Thus  the  number  of  subtrapezoids  is  not  hxed,  but  will  either 
be  0,  1,  or  2.  (For  example,  in  Fig.  8(b)  there  is  only  one  subtrapezoid.)  Thus,  the 
procedure  described  for  the  horizontal  case  can  be  applied  here  as  well.  The  only  change 
is  to  determine  the  actual  number  of  canonical  strips  that  are  spanned  by  the  slanted 
sides  after  rounding. 

Summary.  The  results  of  this  section  are  summarized  in  the  following  two  lemmas. 

Lemma  4  The  preprocessing  for  all  canonical  strips  for  a  given  slope  can  be  done  in 
0(nm)  time  and  0(nm)  space. 

Proof:  We  consider  the  case  of  high  slopes.  Low  slopes  follow  from  a  symmetrical 
argument,  and  horizontal  and  vertical  strips  have  already  been  discussed.  By  Lemma  2 
the  number  of  canonical  strips  for  any  slope  is  0{m  +  n).  We  will  establish  the  result 
hrst  for  strips  of  horizontal  width  1,  and  then  modify  this  for  strips  of  width  0.5. 

For  each  canonical  strip  of  width  f,  each  row  of  the  image  intersects  the  strip  in  a 
single  grid  point  (assuming  consistent  rounding).  We  can  determine  these  grid  points  by 
computing  the  point  at  which  the  canonical  line  intersects  the  image  rectangle  if,  and 
then  applying  an  appropriate  modihcation  of  any  standard  line  digitization  algorithm, 
(for  example,  Bresenham’s  algorithm  [1,  6])  to  the  canonical  lines  dehning  the  boundary 
of  the  strip.  As  we  digitize  the  line  we  can  maintain  the  prehx  sum  and  assign  it  to  each 
pixel  in  the  strip.  Such  an  algorithm  takes  time  proportional  to  the  number  of  pixels 
processed  within  the  strip.  The  total  time  over  all  strips  will  be  equal  to  the  number 
of  strips  plus  the  total  number  of  pixels  processed.  Because  the  canonical  strips  for  any 
slope  cover  every  pixel  in  the  image  exactly  once,  it  follows  that  the  total  time  to  process 
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all  the  strips  for  this  slope  will  be  0(mn).  Since  each  pixel  holds  a  single  prehx  valne 
the  space  is  also  0(mn). 

We  can  modify  this  scheme  for  canonical  strips  of  width  0.5  as  follows.  Snppose  we 
have  already  digitized  the  canonical  strips  of  nnit  width.  For  each  snch  strip,  we  digitize 
the  canonical  line  lying  midway  between  the  sides  of  the  strip,  determining  which  grid 
points  of  the  strip  lie  on  the  left  side  and  which  lie  on  the  right  side  of  this  midway  line. 
Again  this  can  be  done  nsing  any  line  digitization  algorithm.  We  replace  the  single  prehx 
snm  valne  in  each  pixel  with  a  pair  of  valnes,  one  for  the  left  snbstrip  and  one  for  the 
right  snbstrip.  Since  each  row  of  the  original  strip  contribnted  a  single  grid  point  to  the 
strip,  there  will  be  an  entry  for  each  snbstrip  and  for  each  row.  As  before,  this  can  be 
performed  in  0{mn)  time  and  space.  □ 

The  following  lemma  has  already  been  established  in  the  conrse  of  this  section. 

Lemma  5  After  preprocessing  has  been  completed,  if  the  weight  of  the  canonical  digitiza¬ 
tion  of  a  placement  of  a  primitive  shape  t  +  Ki  is  known,  then  the  weight  of  the  canonical 
digitization  of  any  unit-length  translation  of  the  shape,  either  horizontally  or  vertically, 
can  be  computed  in  constant  time. 

3.4  The  Canonical  Convolution  Algorithm 

We  can  now  complete  the  description  of  the  algorithm  for  compnting  the  approximate 
convolntion.  As  mentioned  before,  the  algorithm  operates  by  compnting  the  weights  of 
the  canonical  digitizations  for  each  of  the  0{k)  primitive  shapes,  and  then  compnting 
the  weighted  snm  over  all  these  shapes.  Lemma  3  states  that  the  resnlting  snm  is  a 
valid  digitization,  and  hence  the  resnlting  convolntion,  called  the  canonical  convolution, 
is  a  valid  convolntion.  Here  is  the  entire  algorithm,  which  is  given  the  inpnt  image 
/[!, .  .  . ,  m,  1, .  .  . ,  n]  and  the  kernel  polygon  K  with  k  sides. 

(1)  Using  the  method  of  Section  3.f,  snbdivide  K  into  r  =  0{k)  primitive  shapes 
Ki,  K2, .  .  . ,  Kr  with  associated  weights  Wi. 

(2)  Compnte  the  prehx  snms  for  the  rows  and  colnmns  of  the  image  in  0{mn)  time. 
Initialize  the  convolntion  image  C[\, .  .  . ,  m,  1, .  .  . ,  n]  to  0. 

(3)  For  i  from  f  to  r,  perform  the  following  steps: 

(a)  By  brnte  force  compnte  the  canonical  digitization  of  at  fo  =  (1,  !)• 

(b)  If  Ki  is  a  right  triangle,  let  s  denote  the  slope  of  its  slanted  side.  Compnte  the 
canonical  lines  for  this  slope  (as  dehned  in  Section  3.2)  and  compnte  prehx 
snms  for  the  resnlting  canonical  strips  nsing  the  method  ontlined  in  Lemma  4. 

(c)  For  the  translation  vector  t  zig-zagging  along  the  grid  points  of  the  rows  and 
colnmns  of  the  image  rectangle  R,  incrementally  npdate  the  weight  of  the 
canonical  digitization  of  the  new  placement  of  Ki,  nsing  the  methods  de¬ 
scribed  in  Section  3.3.  From  Lemma  5  we  know  that  each  npdate  can  be 
compnted  in  constant  time.  Add  each  of  the  resnlting  weights,  times  Wi,  into 
the  convolntion  image  C[t]. 
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The  correctness  of  this  procedure  has  been  established  in  the  previous  discussion.  The 
running  time  of  step  (f)  is  0{k).  Step  (2)  can  be  performed  in  0{mn)  time.  Step  (3a) 
can  be  performed,  by  any  algorithm  for  digitizing  convex  polygons  [6,  8].  The  running 
time  of  such  an  algorithm  is  proportional  to  the  number  of  pixels  covered  by  if  fl  if,  and 
this  can  be  at  most  0{mn).  By  Lemma  4,  step  (3b)  also  takes  0{mn)  time.  Step  (3c) 
takes  0{mn)  time,  since  each  update  takes  constant  time,  and  the  shape  is  incrementally 
shifted  to  each  of  the  mn  grid  points  in  the  image.  Since  it  is  repeated  for  each  of  the 
0{k)  primitive  shapes,  step  (3)  takes  total  time  0{kmn). 

As  mentioned  in  Lemma  4,  the  space  requirements  are  0{mn)  per  slope.  Since  we  can 
discard  the  prehx  sums  computed  in  step  (3b)  after  their  use  in  step  (3c),  we  need  to  keep 
only  three  copies  of  the  prehx  sums  at  any  time  (one  for  the  slanted  slope,  one  for  the 
rows,  and  one  for  the  columns).  Thus  the  total  space  requirements  are  0(3mn)  =  0{mn). 
This  establishes  our  main  result.  Theorem  1. 

4  Conclusions 

We  have  presented  an  efhcient  algorithm  for  computing  approximate  convolutions  for 
binary  kernels  that  are  modeled  as  convex  &-sided  polygons.  Our  algorithm  runs  in 
0(kmn)  time  on  an  m  X  n  image,  irrespective  of  the  area  or  perimeter  of  the  kernel. 
Our  approach  is  based  on  a  special  type  of  digitization  of  the  kernel,  called  a  canonical 
digitization,  which  varies  from  one  placement  to  the  next.  We  have  shown  that  canonical 
digitizations  can  be  updated  efhciently  through  the  use  of  prehx  sums. 

Some  interesting  open  problems  are  suggested  by  this  work.  One  question  is  whether 
these  techniques  can  be  generalized  to  convolutions  involving  kernels  that  are  multi¬ 
valued,  or  to  nonconvex  simple  polygons.  In  theory,  such  a  kernel  could  be  subdivided 
into  convex  parts.  However,  Lemma  3,  which  establishes  the  validity  of  the  canonical 
digitization,  does  not  generalize  immediately  to  collections  of  convex  polygons.  Another 
question  is:  If  approximate  convolutions  are  used  to  approximate  morphological  opera¬ 
tions  (such  as  dilation),  what  can  be  said  about  the  properties  of  the  resulting  shapes, 
as  compared  with  their  exact  counterparts? 
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